Picture for Yulan Hu

Yulan Hu

Multi-Stakeholder LLM Alignment: Decomposing Estimation from Aggregation

Add code
May 26, 2026
Viaarxiv icon

GroupTravelBench: Benchmarking LLM Agents on Multi-Person Travel Planning

Add code
May 24, 2026
Viaarxiv icon

TRACE: Distilling Where It Matters via Token-Routed Self On-Policy Alignment

Add code
May 11, 2026
Viaarxiv icon

Aligning Agents via Planning: A Benchmark for Trajectory-Level Reward Modeling

Add code
Apr 09, 2026
Viaarxiv icon

Learn More with Less: Uncertainty Consistency Guided Query Selection for RLVR

Add code
Jan 30, 2026
Viaarxiv icon

No More Stale Feedback: Co-Evolving Critics for Open-World Agent Learning

Add code
Jan 11, 2026
Viaarxiv icon

TravelBench: A Broader Real-World Benchmark for Multi-Turn and Tool-Using Travel Planning

Add code
Jan 05, 2026
Viaarxiv icon

AMAP Agentic Planning Technical Report

Add code
Dec 31, 2025
Viaarxiv icon

TravelBench: A Real-World Benchmark for Multi-Turn and Tool-Augmented Travel Planning

Add code
Dec 27, 2025
Viaarxiv icon

Towards Reward Fairness in RLHF: From a Resource Allocation Perspective

Add code
May 29, 2025
Viaarxiv icon